Search CORE

62 research outputs found

Cross-species comparison of genome-wide expression patterns

Author: Gibson Greg
Zhou Xianghong Jasmine
Publication venue: BioMed Central
Publication date: 21/06/2004
Field of study

The rapid accumulation of microarray data from multiple species provides unprecedented opportunities to study the evolution of biological systems. Recent studies have used cross-species comparisons of expression profiles to annotate gene functions, to draw evolutionary inferences concerning specific biological processes and to study the global properties of expression networks

PubMed Central

University of Queensland eSpace

Integrative missing value estimation for microarray data

Author: Hu Jianjun
Li Haifeng
Waterman Michael S
Zhou Xianghong Jasmine
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. RESULTS: We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests. CONCLUSION: We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Scholar Commons - Institutional Repository of the University of South Carolina

An integrative modular approach to systematically predict gene-phenotype associations

Author: Dai Chao
Mehan Michael R
Nunez-Iglesias Juan
Waterman Michael S
Zhou Xianghong Jasmine
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms. Results We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules. Conclusion We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

Joint Genome-Wide Profiling of miRNA and mRNA Expression in Alzheimer's Disease Cortex Reveals Altered miRNA Regulation

Author: Caleb E. Finch
Chun-Chi Liu
Juan Nunez-Iglesias
Stefan Maas
Todd E. Morgan
Xianghong Jasmine Zhou
Publication venue: Public Library of Science
Publication date: 01/02/2010
Field of study

Although microRNAs are being extensively studied for their involvement in cancer and development, little is known about their roles in Alzheimer's disease (AD). In this study, we used microarrays for the first joint profiling and analysis of miRNAs and mRNAs expression in brain cortex from AD and age-matched control subjects. These data provided the unique opportunity to study the relationship between miRNA and mRNA expression in normal and AD brains. Using a non-parametric analysis, we showed that the levels of many miRNAs can be either positively or negatively correlated with those of their target mRNAs. Comparative analysis with independent cancer datasets showed that such miRNA-mRNA expression correlations are not static, but rather context-dependent. Subsequently, we identified a large set of miRNA-mRNA associations that are changed in AD versus control, highlighting AD-specific changes in the miRNA regulatory system. Our results demonstrate a robust relationship between the levels of miRNAs and those of their targets in the brain. This has implications in the study of the molecular pathology of AD, as well as miRNA biology in general

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

An integrative approach to characterize disease-specific pathways and their coordination: a case study in cancer

Author: Kao Ming-Chih J
Nevins Joseph R
Nunez-Iglesias Juan
West Mike
Xu Min
Zhou Xianghong Jasmine
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

BACKGROUND: The most common application of microarray technology in disease research is to identify genes differentially expressed in disease versus normal tissues. However, it is known that, in complex diseases, phenotypes are determined not only by genes, but also by the underlying structure of genetic networks. Often, it is the interaction of many genes that causes phenotypic variations. RESULTS: In this work, using cancer as an example, we develop graph-based methods to integrate multiple microarray datasets to discover disease-related co-expression network modules. We propose an unsupervised method that take into account both co-expression dynamics and network topological information to simultaneously infer network modules and phenotype conditions in which they are activated or de-activated. Using our method, we have discovered network modules specific to cancer or subtypes of cancers. Many of these modules are consistent with or supported by their functional annotations or their previously known involvement in cancer. In particular, we identified a module that is predominately activated in breast cancer and is involved in tumor suppression. While individual components of this module have been suggested to be associated with tumor suppression, their coordinated function has never been elucidated. Here by adopting a network perspective, we have identified their interrelationships and, particularly, a hub gene PDGFRL that may play an important role in this tumor suppressor network. CONCLUSION: Using a network-based approach, our method provides new insights into the complex cellular mechanisms that characterize cancer and cancer subtypes. By incorporating co-expression dynamics information, our approach can not only extract more functionally homogeneous modules than those based solely on network topology, but also reveal pathway coordination beyond co-expression

Crossref

Springer - Publisher Connector

PubMed Central

University of Melbourne Institutional Repository

Gene Aging Nexus: a web database and data mining platform for microarray data on aging

Author: Chiu Chi-Hsien
Finch Caleb E.
Kamath Kiran
Mehan Michael R.
Nunez-Iglesias Juan
Pan Fei
Pulapura Sudip
Waterman Michael S.
Zhang Kangyu
Zhou Xianghong Jasmine
Publication venue: Oxford University Press
Publication date: 07/11/2006
Field of study

The recent development of microarray technology provided unprecedented opportunities to understand the genetic basis of aging. So far, many microarray studies have addressed aging-related expression patterns in multiple organisms and under different conditions. The number of relevant studies continues to increase rapidly. However, efficient exploitation of these vast data is frustrated by the lack of an integrated data mining platform or other unifying bioinformatic resource to enable convenient cross-laboratory searches of array signals. To facilitate the integrative analysis of microarray data on aging, we developed a web database and analysis platform ‘Gene Aging Nexus’ (GAN) that is freely accessible to the research community to query/analyze/visualize cross-platform and cross-species microarray data on aging. By providing the possibility of integrative microarray analysis, GAN should be useful in building the systems-biology understanding of aging. GAN is accessible at

Crossref

PubMed Central

University of Melbourne Institutional Repository

Recommended from our members

DiseaseConnect: a comprehensive web server for mechanism-based disease–disease connections

Author: Chaudhary Preet M.
Chen Jeremy J. W.
Crandall Edward
Li Wenyuan
Liu Chun-Chi
Loscalzo Joseph
Mayzus Ilya
Rzhetsky Andrey
Sun Fengzhu
Tseng Yu-Ting
Waterman Michael
Wu Chia-Yu
Zhou Xianghong Jasmine
Publication venue: 'Oxford University Press (OUP)'
Publication date: 03/06/2014
Field of study

The DiseaseConnect (http://disease-connect.org) is a web server for analysis and visualization of a comprehensive knowledge on mechanism-based disease connectivity. The traditional disease classification system groups diseases with similar clinical symptoms and phenotypic traits. Thus, diseases with entirely different pathologies could be grouped together, leading to a similar treatment design. Such problems could be avoided if diseases were classified based on their molecular mechanisms. Connecting diseases with similar pathological mechanisms could inspire novel strategies on the effective repositioning of existing drugs and therapies. Although there have been several studies attempting to generate disease connectivity networks, they have not yet utilized the enormous and rapidly growing public repositories of disease-related omics data and literature, two primary resources capable of providing insights into disease connections at an unprecedented level of detail. Our DiseaseConnect, the first public web server, integrates comprehensive omics and literature data, including a large amount of gene expression data, Genome-Wide Association Studies catalog, and text-mined knowledge, to discover disease–disease connectivity via common molecular mechanisms. Moreover, the clinical comorbidity data and a comprehensive compilation of known drug–disease relationships are additionally utilized for advancing the understanding of the disease landscape and for facilitating the mechanism-based development of new drug treatments

Harvard University - DASH

National Chung Hsing University Institutional Repository

PubMed Central

Integrative Analysis of Many Weighted Co-Expression Networks Using Tensor Computation

Author: A Ruepp
A Smilde
AA Tsay
AJ Butte
AL Barabasi
AL Yuille
AY Ng
B Breitkreutz
BP Kelley
C Faloutsos
CHQ Ding
Chun-Chi Liu
D Achlioptas
D Tao
DJ Thomas
E Acar
F Pan
FRK Chung
H Chen
H Hu
Haifeng Li
I Bernales
J Flannick
J Sun
J Sun
J Sun
JA Papin
JJ Hopfield
Jörg Stelling
K Kuwahara
K Takahashi
K Toeda
KA Allen
L Mao
L Omberg
LR Tucker
M Ashburner
M Kalaev
M Kanehisa
M Koyuturk
M Koyuturk
M Nicolás
M Xu
M Xu
MA Serrano
MEJ Newman
Michael S. Waterman
MR Mehan
MW Mahoney
N Genkai
O Alter
O Alter
O Alter
RB Cattell
S Arora
S Miard
T Zhang
T Zhang
TG Kolda
Tong Zhang
TS Motzkin
TW Anderson
U Luxburg
V Spirin
W Li
Wenyuan Li
X Yan
X Yan
X Zhou
Xianghong Jasmine Zhou
Y Huang
Y Yu
YP Deniélou
Publication venue: Public Library of Science
Publication date: 01/06/2011
Field of study

The rapid accumulation of biological networks poses new challenges and calls for powerful integrative analysis tools. Most existing methods capable of simultaneously analyzing a large number of networks were primarily designed for unweighted networks, and cannot easily be extended to weighted networks. However, it is known that transforming weighted into unweighted networks by dichotomizing the edges of weighted networks with a threshold generally leads to information loss. We have developed a novel, tensor-based computational framework for mining recurrent heavy subgraphs in a large set of massive weighted networks. Specifically, we formulate the recurrent heavy subgraph identification problem as a heavy 3D subtensor discovery problem with sparse constraints. We describe an effective approach to solving this problem by designing a multi-stage, convex relaxation protocol, and a non-uniform edge sampling technique. We applied our method to 130 co-expression networks, and identified 11,394 recurrent heavy subgraphs, grouped into 2,810 families. We demonstrated that the identified subgraphs represent meaningful biological modules by validating against a large set of compiled biological knowledge bases. We also showed that the likelihood for a heavy subgraph to be meaningful increases significantly with its recurrence in multiple networks, highlighting the importance of the integrative approach to biological network analysis. Moreover, our approach based on weighted graphs detects many patterns that would be overlooked using unweighted graphs. In addition, we identified a large number of modules that occur predominately under specific phenotypes. This analysis resulted in a genome-wide mapping of gene network modules onto the phenome. Finally, by comparing module activities across many datasets, we discovered high-order dynamic cooperativeness in protein complex networks and transcriptional regulatory networks

Crossref

Directory of Open Access Journals

PubMed Central

Frequent Pattern Discovery in Multiple Biological Networks: Patterns and Algorithms

Author: A Barabasi
A Ruepp
AA Tsay
AJ Butte
AJ Butte
B Kelley
B Suman
D Achlioptas
H Hermjakob
H Hu
Haifeng Li
Haiyan Hu
J Gollub
J Papin
JA Mitchell
Juan Nunez-Iglesias
L Breiman
LF Wu
M Ashburner
M Koyutürk
M Koyutürk
M Koyutürk
M Xu
MA Serrano
MEJ Newman
Michael R. Mehan
Min Xu
MR Mehan
R Edgar
R Sharan
S Arora
S Kirkpatrick
T Zhang
V Krishna
W Li
Wenyuan Li
X Yan
X Zhou
Xianghong Jasmine Zhou
Xifeng Yan
Y Collette
Y Huang
Yu Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recommended from our members

Methylation extends the reach of liquid biopsy in cancer detection

Author: Li Wenyuan
Zhou Xianghong Jasmine
Publication venue: eScholarship, University of California
Publication date: 01/11/2020
Field of study

Measuring the methylation status of cell-free DNA (cfDNA) in plasma holds great potential for the early, noninvasive detection of cancer. Two recent papers published in Nature Medicine showcase the successful application of cfDNA methylation- based cancer detection to two highly challenging scenarios

eScholarship - University of California